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^Tp Abstract Monte Carlo event generators are essential components of almost all ex- 

^h perimental analyses and are also widely used by theorists and experiments to make 

^j" predictions and preparations for future experiments. They are all too often used as 

C^ "black boxes", without sufficient consideration of their component models or their 

reliability. In this set of three lectures we hope to open the box and explain the phys- 

C"| ical bases behind the models they use. We focus primarily on the general features of 

Qh parton showers, hadronization and underlying event generation. 

Oh 

0) 

| — ' 1 Motivation and Overview 

^ Monte Carlo (MC) event generators are very widely used, especially by experimen- 

talists in analyses but also by many theorists, who use them to make predictions for 
collider experiments and to develop techniques to propose to the experiments. MC 
are extremely important tools in High Energy Physics but unfortunately they are 
often used as "black boxes" whose outcome is treated as data. The aim of these lec- 
tures is to explain the physics behind event generators, which are mostly common 

^r^ between event generators but some differences will be highlighted. 

t-H As an example of the importance of MC, the majority of the recent Higgs discov- 

ery plots rely very strongly on MC predictions, to set limits on Higgses in certain 
parameter space regions as well as to discover them. This should be motivation 

/\ enough to show that we need event generators for doing discovery as well as pre- 

cision physics. Figure 1(a) shows the ATLAS diphoton invariant mass distribution 
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consistent with a Standard Model Higgs boson of 126 GeV. One might ask if event 
generators are really still necessary when a distinct bump such as this one is visi- 
ble. The answer is certainly yes, for example to quantify the significance of such a 
resonance and understand what particle it is. In the H — >• yy channel the resonance 
sits on a very steeply falling background where event generator predictions might 
be less important but all other discovery channels rely extremely heavily on event 
generators. Figure 1(b) shows the CMS four lepton invariant mass distribution from 
the "Golden Channel", where MC predictions are crucial for signal and background 
modelling. 



° 3500 
J 3000^ 
9? 2500^ 

LU Z 

2000^ 
1500^ 
1000^ 
500^ 



ATLAS * Data 



Fit (m H =1 26.5 GeV) 
(4th order polynomial) 




/s=7TeV,JLdt=4.8fb- 1 

(s=8 TeV, Jl_dt=5.9fb _1 H ^YY 



_§> 200 
m 100 - 



° 

-100 
-200 

100 



f i+i i i*+- T K.:. \ v» * V* 



110 120 130 140 150 160 
m„ [GeV] 




(a) 



80 100 120 140 160 180 

nv (GeV) 

(b) 



Fig. 1 Invariant mass distributions of (a) ATLAS H -> yy and (b) CMS H -> ZZ -> 4£ candidates 
for the combined y^ = 7 TeV and yfs = 8 TeV data samples. Reproduced from [1,2]. 



The structure of a proton-proton collision at the Large Hadron Collider (LHC) as 
built up by event generators can be described by a few main steps. These are illus- 
trated in Figure 2 where two protons come in from either side and make a collision. 
The colour coding corresponds to the steps into which most event generators divide 
the process: 

1. Hard process 

2. Parton shower 

3. Hadronization 

4. Underlying event 

5. Unstable particle decays 

The first thing an experimentalist notices when studying proton-proton collisions is 
that most of them are "boring" in the sense that only a few soft hadrons are produced 
and most of the event goes out along the beam pipe direction. Only a tiny fraction 
of events contain a high momentum-transfer process of interest. It is therefore not 
feasible to simulate all possible proton-proton collisions but the simulation needs 
to be structured with a focus on deciding what hard process is wanted (a bit like 
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Fig. 2 Diagram showing the structure of a proton-proton collision, where the different colours 
indicate the different stages involved in event generation. 



triggers at experiments which decide which events to write to tape and which to 
discard). 

This is done by starting the simulation at the heart of the collision and calculating 
from perturbation theory the probability distribution of a particular hard scatter, 
which is the highest momentum transfer process in the event. Simulating the hard 
process is relatively straightforward because Parton Distribution Functions (PDFs) 
describe partons coming into the process and lowest order perturbation theory gives 
a probabilistic distribution of the outgoing partons. 

A more interesting stage of event generation comes from asking what happens to 
the incoming and outgoing partons involved in the hard collision. This is described 
by the parton shower phase of event generators. The partons involved in the hard 
process are coloured particles, quarks and gluons. From Quantum Electrodynamics 
(QED) it is well known that scattered electric charges radiate photons, this is what is 
called Bremsstrahlung. In the same way, scattered colour charges radiate gluons and 
this happens for partons on their way in and out of a collision. The main difference 
to QED is that, due to the non-Abelian structure of 51/ (3), gluons themselves are 
coloured and so an emitted gluon can itself trigger new radiation. This leads to an 
extended shower and the phase space fills up with (mostly) soft gluons. The parton 
shower can be simulated as a sequential step-by-step process that is formulated as 
an evolution in momentum transfer scale. The parton shower evolution starts from 
the hard process and works downwards to lower and lower momentum scales to a 
point where perturbation theory breaks down. 

Here it is necessary to switch to hadronization models, which take account of 
the confinement of a system of partons into hadrons, which are seen in the detector. 
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As well as the confinement of the produced partons, it is important to remember 
that the initial, uncoloured proton has had a coloured parton taken out if it and so 
it has been left in a coloured state. To get an idea of the space time structure of a 
collision, consider the fact that in a proton's own rest frame it is a spherical bound 
state, but in the lab frame the two protons are moving towards each other at very 
high speed and the Lorentz contraction flattens them into extremely thin pancakes. 
The collision happens at a point where these flat discs are completely overlapping 
each other in space time and so there is a very high probability that there will be 
other interactions apart from the hard interaction. This gives rise to the underlying 
event, which is made up of secondary interactions between proton remnants. It pro- 
duces soft hadrons everywhere in the event, which overlie and contaminate the hard 
process that was already simulated. 

The last component of event generation, which is usually not discussed in as 
much detail, is the fact that many of these hadrons are not stable particles but heavy 
resonances that then go on to decay. A lot of improvement has been made in the last 
five years to model these secondary particle decays. 

Although some details differ, this brief overview of a process from hard collision 
to stable hadrons is effectively used by all current general purpose event generators, 
i.e. Herwig, Pythia and Sherpa. The lectures are organized into three main parts: 
parton shower, hadronization as well as underlying event and soft inclusive physics 
models. More details can be obtained from [3]. The classic textbook on the subject 
is [4]. 



2 Parton showers 

The basic idea of the parton shower is to set up in a probabilistic way a simulation 
of the cascade of partons that is produced by the colour charges that are accelerated 
in a scattering process, or created or annihilated in a pair creation process. The 
simulation of final state radiation (FSR) will be discussed, namely what happens to 
the partons as they leave the hard collision. Finally, it will be shown that the main 
ideas for FSR can also be applied to initial state radiation (ISR). 



2.1 Divergence ofQCD Emission Matrix Elements 

First we want to look at the simplest, non-trivial Quantum Chromodynamics (QCD) 
process one can study, e+e~ annihilation to jets. The tree-level cross section for 
e + e~ annihilation to two partons (qq) is finite and, from a QCD point of view, does 
not have much interest. However, the first correction to this process, namely e+e~ 
to three partons is already very interesting. It is a good example of the more general 
statement that almost all QCD matrix elements diverge in different regions of phase 
space. It is the need to understand these divergences that will lead to the parton 
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shower description of FSR. If we want to calculate the distribution of three partons 
in the final state, we need to sum two Feynman diagrams, shown in Figure 3, at the 
amplitude level and then square them. 





Fig. 3 Feynman diagrams for the process e+e —> qqg. 

One can calculate the differential cross section and write it, as shown in Equa- 
tion 1 , in terms of the opening angle 6 between the quark and the gluon, the energy 
fraction of the gluon z g = E g /E g ^ max , the total e+e~ — )> qq cross section Ob, the quark 
charge squared Cp and the QCD running coupling constant a s ~ 0.1, 

do a s 2 l + (l-z g ) 2 

CTO^F^ T^~Z • (1) 



dcos0dz g In sin 2 

This formula has several interesting features. It has a factorized form as it is propor- 
tional to Go and so one can think of this as a two step process: first the e + e~ makes a 
two parton system, which in turn produces an extra gluon. Another important point 
is that it is not possible to calculate the whole probability as it is divergent in the 
collinear limit (6 — >> 0, 7t) 1 and in the soft limit (z g — » 0) 2 . 

First we think about the physics of the collinear limit G — >■ 0, % of QCD matrix 
elements. Parts of the previous equation can be separated into two pieces 

2dcos0 dcos0 dcosfl 

(2) 



sin 2 6 1 — cos 6 1 + cos G 
dcos0 dcosS 



1 — cos 6 1 — cos 6 
dG 2 dO 2 



(3) 
(4) 



where 6 = it — 6 is the angle between the antiquark and the gluon 3 . From the middle 
line, we can see that we have separated this into two independent terms, the first 



1 Assuming massless quarks; the massive case will be discussed later. 

2 It should be noted that here we parameterized the kinematics in the rest frame of the virtual 
photon, i.e. the rest frame of the total hadronic, three parton system. This might make it look like 
it is not Lorentz invariant, but one can show that the energy and angle dependence will always 
conspire in such a way that the final distributions are frame independent. 

3 This is an approximation that becomes exact in the limits of collinear or soft emission that we 
are interested in. 
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(second) term is only divergent in the 6 — )> (6 — )> 0) quark (antiquark) limit. These 
terms can be approximated again to expose more clearly that this is a logarithmically 
divergent distribution. We have written this in a manner that appears sequential: in 
a first step the qq pair is produced and in a second step the gluon is radiated. The 
probability distribution of this gluon was separated out into the sum of two pieces, 
where one is associated to the quark direction and one to the antiquark direction. 
Rewriting the differential cross section, we can think of this as a system where each 
jet is evolving independently, each of which has a collinear factor: 

Here, we have set this up in terms of the opening angle 6 as it is convenient, but 
we could build something of the exact same form that is proportional to 2 , e.g. the 
transverse momentum of the gluon relative to the qq axis, 

kl=z 2 (l-z) 2 2 E\ (6) 

or the total invariant mass of the quark gluon system, 

q 2 =z(l-z)0 2 E 2 , (7) 

so that 

dfl 2 _ d/^ _ d^ 

2 k\ q 2 ' w 

The choice of this variable is one of the important differences between various par- 
ton shower algorithms. In the limit of — )> 0, k± —> 0, q — )> 0, all these variables 
give the same leading approximation to the full cross section, Equation (5), so in 
describing the cross section with leading accuracy they are equivalent and this is 
formally a free choice. However, each involves different sub-leading corrections to 
the leading approximation, so the choice can, and does in practice, have important 
consequences for the distributions produced. 



2.2 Collinear Limit 

In Equation 5, the differential cross section was written in a factorized form <7o times 
the sum over all hard partons that are involved in the process. One can show that this 
is a universal feature of QCD matrix elements and not just unique to this e + e~ case. 
This differential cross section can be written in a universal way for an arbitrary 
hard process 

a s d0 2 , , L x d0 



Monte Carlo Event Generators 7 

where z is the energy fraction of the parton, (j) is the azimuthal angle of the splitting 
around the axis of the parent parton and P(z, 0) is known as the Dokshitzer-Gribov- 
Lipatov-Altarelli-Parisi (DGLAP) splitting kernel, which depends on flavour and 
spin. To give a few spin-averaged (no (j) dependence) examples of the latter, 

p q -+ qg (z) = c F Y^, (io) 

P q ^ gq {z)=C F l " r{l ~ z) , (11) 

z 

P g^gg(Z) = C A z (\- z ) ' (12) 

P g ^ q - q ( z ) = T R (z 2 + (l-zf). (13) 

To use the collinear limit, we do not have to take the partons to be exactly 
collinear, it is sufficient that the opening angle is much smaller than any angles 
involved in the hard process. In this limit we get this universal behaviour. 

However, we still cannot describe this process probabilistically because Equa- 
tion (9) still diverges when integrated over all possible angles. To understand the 
physics behind this divergence, let us take a step back and think about what we 
mean by having a parton in the final state of our process. As we will discuss later, 
partons produce jets with a finite spread of hadrons. The hadronic state produced 
by two exactly collinear partons is identical to that produced by a single parton 
with their total momentum (and colour). Without yet going into the details of the 
hadronization process, let us assume that there is some value of momentum of one 
parton transverse to the axis defined by another, below which they cannot be re- 
solved 4 and only calculate the probability distribution for resolvable partons. That 
is, we introduce an arbitrary parameter, go> which describes whether two partons 
are resolvable from each other or not. If k± > Qq, we call them resolvable and use 
perturbation theory to describe them as two separate particles. If k± < <2o we say 
that they are indistinguishable from a single parton with the same total momentum. 

Now we can calculate the total probability for resolvable emission, which must 
be finite. Since unresolvable emission is indistinguishable from non-emission, we 
must add together the virtual (loop) correction to the original process and the integral 
of the emission probability over the unresolved region. Each is divergent, but the 
virtual divergence is negative and exactly cancels the real divergence. So although 
each of them is divergent, their sum is finite and obeys unitarity: 

P(resolved) + P (unresolved) = 1 . (14) 

It is important to note that this fact is derived from Quantum Field Theory (QFT) 
and is not just assumed. One can encode the non-emission probability by something 
called a Sudakov form factor (SFF), which is a key ingredient of the MC. This SFF 



4 Note that we are not talking about our experimental ability to resolve jets, but an in principle 
indistinguishability of exactly collinear partons. 



8 Michael H. Seymour and Marilyn Marx 

represents the probability that a given parton does not radiate resolvable gluons and 
has an exponential form. The probability of emission between q 2 and q 2 + dq 2 is 



"-^""-"M"^ 



a- 



(15) 



and the probability of no emission between Q 2 and q 2 is defined to be A(Q 2 ,q 2 ). 
This gives the evolution equation 

dA(Q 2 ,q 2 ) A(n 2 2^ n ,. 

—^—=A(Q,q)-^, (16) 

^A(Q 2 ,q 2 )=cxp-j 2 -^Pik 2 ). (17) 

This has a very similar form to the well-known formula of radioactive decay, where 
an atom has a (constant) probability A per unit time to decay: 



P(no decay after time T) = exp — / dt X . 



(18) 



The Sudakov form factor, A(Q 2 ^Qq)=A (Q 2 ), represents the probability of emit- 
ting no resolvable radiation at all: 

VeVexp-C^log 2 ^, d9) 

which becomes very small for large Q 2 , reflecting the fact that a quark formed at a 
high scale is extremely unlikely to be unaccompanied by any gluons. 



2.3 Multiple emission and running coupling 

We can use the universality of the DGLAP splitting function to calculate the prob- 
ability that, given an initial emission, a certain gluon radiates another gluon that is 
more collinear than the first one. This way we can attach more and more partons, as 
is shown in an example in Figure 4. We can take the different building blocks that 
we have just derived, namely the tree-level splitting function and the SFF, which 
tells us the non-emission probability, and use them to construct the probability dis- 
tribution of any complicated final state. It should be noted that one important point 
that needs to be specified is the initial condition. It tells us how large the initial value 
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q\ of the evolution variable can be. This is the only process dependent factor in the 
parton shower and we will come back to this later 5 . 




Qi >q£-.. 



Fig. 4 Diagram showing multiple gluon emission off an initial quark line. 



To quickly touch on higher order loop corrections to emitted gluons, one can 
absorb a tower of higher order logarithmic corrections by replacing a? by a s (kj). 
This is because at each higher order, of, one encounters terms like j3q ln n k^/fl 2 , 
where ft is the leading coefficient of the QCD beta-function and \i is the renormal- 
ization scale at which a s is defined. If jl 2 is very different to k\ these terms are very 
large, spoiling the convergence of perturbation theory. But if the emission vertex 
is evaluated with a? replaced by a s (1Cj_), these terms are absorbed and effectively 
resummed to all orders. Thus, taking account of the increasing coupling at small 
kj_, the parton evolution is expected to fill the phase space with soft (low transverse 
momentum) gluons as it becomes increasingly "easy" (i.e. the probability becomes 
high) to emit very soft gluons. This further means that the go parameter is a very im- 
portant physical parameter that constrains the parton shower algorithm. In order to 
use perturbation theory, one has to ensure that go is much larger than Aqcd, which 
is ^ ^(200 MeV), and so it should be of order 1 or a few GeV. 



2.4 Soft limit and Angular ordering 

Now we want to move from the collinear limit to the soft limit, which is the other 
limit in which QCD matrix elements diverge. There is also a factorisation theorem 
for the soft limit but it has a very different form in the sense that it is universal only 
at the amplitude level, not at the cross section level. 

Consider a quark coming out of a hard process that radiates a hard gluon. We 
want to know what the distribution of soft gluons radiated from this system looks 
like. Going back to QFT, there are two possible Feynman diagrams that contribute, 
which we illustrate by the single diagram in Figure 5(a). The soft gluon is attached 



5 One can think of this initial condition as anything that parametrizes the "collinear-ness" of an 
emission process, e.g. the virtuality, how far off-shell this particle was. If we do not know how a 
coloured particle was produced, we cannot know how it radiates. 
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to the hard gluon in one diagram and to the quark in the other. In either of these 
cases there is a factorisation theorem telling us that the amplitude for that process 
can be written as the amplitude to produce the hard gluon times a universal factor 
describing the radiation of the soft gluon. However, unlike in the collinear limit, we 
need to sum these two diagrams before we square them and since the amplitudes that 
they represent have similar magnitudes there will be quantum interference between 
them. At first this was thought to spoil the picture of independent evolution, as 
described before in the collinear limit. Actually this is not the case because the 
radiation from these two is coherent. If we sum up the two diagrams at the amplitude 
level and square them, the radiation pattern from this pair of partons is identical at 
large angles to the radiation pattern from a single quark with the same total colour 
charge and same total momentum as the pair of partons had if it was on-shell as 
shown on Figure 5(b). So at large angles, the gluons essentially only see the total 
colour charge, they cannot resolve the colour charges of the individual partons. On 
the other hand, when the opening angle to one of the hard partons is small, the 
corresponding single diagram dominates. We can incorporate this into our collinear 
parton shower algorithm by ordering in the opening angle, which will therefore 
correctly describe the soft limit. 

We can conclude that it is possible to construct a parton shower that can describe 
correctly both the collinear and the soft limits of QCD matrix elements by using the 
opening angle as the evolution variable and describing a wide angle gluon as if it 
was emitted before the internal structure of the jet has built up 6 . 





Fig. 5 Diagram describing the soft limit: the soft gluon may be emitted by either of the outgoing 
partons, at the amplitude level (a), but the sum of the two diagrams is as if it was emitted by a 
single parton with the same total momentum and colour (b). 



6 At this point, we assume that the quarks are massless. We will see that the picture does not change 
radically for massive quarks, although for quarks heavier than Aqcd the quark mass actually plays 
a similar role to the resolution scale that we have discussed, cutting off collinear emission. 
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FSR is fully inclusive in the sense that we want to generate the distribution of all 
possible parton radiation, for ISR the goal is different. Here, we want to be able to 
choose the hard process and ask what radiation this process is accompanied by. So 
even though the physics involved in ISR and FSR is essentially the same, we have 
certain kinematic constraints for ISR, e.g. we know x and Q 2 , and we therefore do 
not want to generate all possible distributions but only those subject to having a 
fixed parton momentum at the end of the process. As illustrated in Figure 6, we can 
reformulate the evolution as a backward evolution, which probabilistically undoes 
the DGLAP evolution equations. We start from a particular x and Q 2 point and work 
down in q 2 and up in x towards the incoming hadron, asking progressively, what 
is the probability distribution of radiation that accompanies a parton of this flavour 
and kinematics. In the end, one finds that this algorithm is identical to FSR, but with 
4(<2V) replaced by 4(GV)//K*,<7 2 )- 



Q 2 




Fig. 6 The green lines illustrate the flow of information in analytic solutions of the DGLAP evo- 
lution equation, which yields the value of the parton distribution function at a given value of x 
and Q 2 as a function of its values at some lower value of Q 2 , Qq, and all higher values of x. The 
red lines illustrate typical backward evolution paths that lead to the same x and Q 2 value: each path 
corresponds to one event and each corner on the path to one emitted parton. 
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2.6 Hard Scattering and Colour Coherence 

We need to set the initial conditions for parton showers and here the colour co- 
herence that we already talked about when studying the soft limit of QCD matrix 
elements, is important too. We take the example of quark-antiquark pair annihila- 
tion, say uu — )> dd. The rules of perturbative QCD tell us that the quarks are in the 
fundamental representation of SU(Nc = 3) and the gluons are in the adjoint rep- 
resentation, which has N^—l (= 8 for SU (3)) colours. Up to corrections of order 
1/Ng, we can think of a gluon as carrying a fundamental colour and a fundamental 
anti-colour label. In that approximation, the colour structure of this qq annihilation 
process looks completely different to the flavour structure (the "upness" is annihi- 
lated and becomes "downness", but the "redness" of an incoming quark gets trans- 
ferred onto the ^-channel gluon and thence onto the outgoing quark). The effect of 
the colour structure is best illustrated by contrasting a uu —> dd event in which the 
d quark goes in the forward direction of the u quark with a ud — )> ud event with 
the same kinematics. In uu — >> dd, the colour of the quark has only been scattered 
through a small angle and any emission from it is confined to angles smaller than 
the scattering angle. In ud — )> ud, the colours in the initial state annihilate each other 
and a new colour-anticolour pair is created. Emission from both lines fills the whole 
of phase space. In general, the colour line of any parton can be traced through the 
hard process to find the parton to which it is colour connected: its colour partner. 
Emission from each parton is confined to a cone stretching to its colour partner and 
the colour coherence limits parton radiation in certain regions of phase space. 

The CDF 3-jet analysis is a classic experimental example that demonstrates that 
colour coherence is a real effect. In this analysis, they required two hard jets with 
p ti \ > 1 10 GeV but only p t ^ > 10 GeV. To map out the kinematics of this third jet, 
they looked at the pseudorapidity and jet- separation distributions, which can be seen 
in Figure 7. In these plots 7 , Herwig has colour coherence built in and predicts a dip 
between the hardest and second hardest jet, which was also seen by the data. The 
other two MCs shown, Pythia and Isajet, did not have this colour coherence built in. 
In this case, radiation was allowed to go everywhere and for kinematical reasons the 
radiation actually prefers to go in the central region. This misprediction prompted 
the Pythia authors to provide a better model, which was then called Pythian- and 
had a partial treatment of colour coherence added. This feature is nowadays part of 
Pythia by default. 



2.7 Heavy Quarks 

So far we have only talked about the case where quarks are lighter than the confine- 
ment scale, for which their mass does not play an important role in their evolution 
because it is the confinement itself that provides a cut-off. But if we want to calculate 



7 The MC predictions here have been put through detector simulation. 
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Fig. 7 Distributions from the CDF analysis demonstrating colour coherence in three-jet events. 
R is the separation, in r\ — <f> space, between the second and third hardest jets and 773 is the pseudo- 
rapidity of the third hardest jet. Reproduced from [5]. 



the emission pattern from heavy quarks there is another colour coherence effect at 
work. Figure 8 shows the emission pattern of a final state quark. The massless case, 
drawn in green, goes as 1/0 and diverges as we go to smaller angles. The pattern 
from a massive quark with the same momentum, drawn in blue, becomes similar at 
large angle (this is again the colour coherence effect - wide angle gluons only see 
the total colour) but as we get closer to #o — ^ there is a smooth suppression and 
the true emission pattern turns over and goes to at small angles. This means that 
a massive quark does not radiate at all in its forward direction. This blue curve is 
what is implemented in most current event generators but sometimes people still talk 
about the "dead cone effect", which was the implementation in earlier generators. It 
is the somewhat brutal approximation of treating the massive quark as massless up 
to a certain angle and then just turning off radiation in the forward direction com- 
pletely. This can be shown to give the correct total amount of radiation, but can be 
seen to be a very crude approximation for the distribution of that radiation. 

More often a quasi-collinear splitting is used [6,7], which has a smooth suppres- 
sion in the forward region, 



&&>n. 



a s dq 



?a 



lj ^ lJ In q 2 zP 0^ij( z ^)' 



(20) 



For reference, we give the splitting functions for massive quarks and spartons 8 : 

C F 



l q^qg 



(z) 



1 



1 



H 

zq 2 



(21) 



Since the gluon is massless P g ^ gg is unchanged from equation (12). 
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Fig. 8 Number of radiated gluons, N g , as a function of the opening angle 6 for the massless and 
massive quark case. 
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2.5 Colour Dipole Model 



So far we have been talking about "conventional" parton showers where you start 
from the collinear limit of QCD matrix elements and modify it to incorporate soft 
gluon coherence. In the colour dipole model (CDM) the starting point is somewhat 
different as it tries to understand the soft radiation first and to then modify that 
in such a way that you also get the collinear limit right. The idea is to start from 
the large Nc approximation where a gluon is treated as a colour-anticolour pair or 
dipole. The emission of soft gluons from such a dipole is universal (and classical) 



da i 



a s (k ± ) d*i 



(25) 



Monte Carlo Event Generators 15 

where y = —log tan | is the rapidity. In this model, we think of colour- anticolour 
pairs as radiating, so the qq pair together radiate a gluon. This creates an additional 
colour line, separating the system into a qg dipole and a gq dipole, which go on to 
radiate further. This way subsequent dipoles continue to cascade and instead of a 
1—^2 parton splitting, like in parton showers, you have a 1 ^2 dipole splitting, 
which corresponds to a 2 — >> 3 parton splitting. 

One different feature of the CDM is that there is no explicit ISR. The hadron 
remnant forms a colour dipole with the scattered quark, which is treated like any 
other dipole, except for the fact that the remnant is an extended object. This radiates 
gluons, but since it is an extended object it does not radiate in its forward direction. 
The radiation looks like FSR from the outgoing remnant rather than ISR from the 
ingoing quark, but one can show that they are equivalent, with the suppression of 
radiation from the remnant interpreted as the suppression due to parton distribution 
function effects in ISR. 

Most of the parton shower implementations that have appeared in the last few 
years [8-12] are based on this dipole cascade picture [13, 14]. 



2.9 Matrix Element Matching 

The parton shower method is an approximation derived from QCD that is valid in 
the collinear and soft limits. It describes the bulk of radiation well but very often one 
uses event generators to search for new physics, to predict backgrounds or to model 
features of the signal processes and do precision physics, like e.g. the top mass 
measurement, measuring multi-jet cross sections etc. In these applications, you are 
not only interested in very soft and collinear emission but in systems of hard, well- 
separated jets. Therefore many of the applications of parton shower event generators 
are pushing them into regions of phase space where they are least reliable, i.e. away 
from the soft and collinear approximations and more into regions where fixed order 
matrix elements should describe those processes better. In order to improve their 
predictions, one would like to get simultaneously (at least) next- to-leading order 
(NLO) normalization, a good description of hard multi-jet systems but also match 
that with a good parton shower of the internal structure of those jets - i.e. the best 
of all worlds. Achieving this is known as matrix element matching and is one of the 
areas where MCs have developed the most in the last five years. Several methods 
have been proposed to combine tree-level matrix elements for several jet multiplici- 
ties simultaneously, with parton showers to describe the internal structure of the jets 
and the pattern of soft radiation between the jets (the "intrajet" and "interjet" event 
structure respectively) without double counting (the buzz- words are CKKW [15,16] 
or CKKWL [17] and MLM [18] matching). Alternatively, two methods have been 
proposed to combine lowest-multiplicity NLO matrix elements with parton showers, 
again without double counting (MC@NLO [19] and POWHEG [20]). The current 
state of the art is progress towards NLO multi-jet matching [8, 21-23], which is 
needed for many applications at the LHC. 
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2.10 Summary of available Programs 

We briefly mention some of the parton shower-related features of the different MC 
programs that are available. 

Older programs that are still sometimes seen but not supported any more: 

• Pythia 6.2 [24]: traditional q 2 ordering, veto of non-ordered final state emis- 
sion, partial implementation of angular ordering in initial state, big range of hard 
processes. 

• HERWIG [25]: complete implementation of colour coherence, NLO evolution 
for large x, smaller range of hard processes. 

• Ariadne [26] : complete implementation of colour-dipole model, best fit to HERA 
data, interfaced to Pythia for hard processes. 

Supported and new programs: 

• Pythia 6.3 [27]: /^-ordered parton showers, interleaved with multi-parton inter- 
actions, dipole-style recoil, matrix element for first emission in many processes. 

• Pythia 8 [28] : new program with many of the same features as Pythia 6.3, many 
'obsolete' features removed. 

• Sherpa [29] : new program built from scratch: /^-ordered dipole showers, multi- 
jet matching scheme (CKKW) to AMAGIC++ built in. 

• Herwig++ [30]: new program with similar parton shower to HERWIG (angular 
ordered) plus quasi-collinear limit and recoil strategy based on colour flow, spin 
correlations. 

In addition, dipole showers are available as optional plug-ins to both Herwig++ [31] 
and Pythia [12]. 



2.11 Summary 

The basic idea of parton showers is very simple: accelerated colour charges radiate 
gluons, but since the gluons themselves are also charged we get an extended cascade 
developing. This cascade is modeled as an evolution downward in momentum scale. 
As we approach the non-perturbative limit, we get more and more radiation and 
the phase space fills with soft gluons. The probabilistic language is derived from 
factorization theorems of the full gauge theory. Colour coherence is a fact of life: do 
not trust those who ignore it! 

Modern parton shower models are very sophisticated implementations of pertur- 
bative QCD, but they would be useless without the hadronization model, which will 
be discussed next. 
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Everything we have studied so far was based on perturbative QCD but partons are 
not the final state particles that come out of the collision as they cannot propagate 
freely. We know that hadrons are the physical final state particles, but we do not 
know how to calculate them, so we need a model to describe how partons are con- 
fined into hadrons - this is called hadronization. The two main models in use are 
the String Model, implemented in Pythia, and the Cluster Model, implemented in 
Herwig and S her pa. These models will be described in more detail later but we 
will first look at some of the physics behind hadronization models and how they 
have developed. 



3.1 Phenomenological Models 

We start with an experimental observation, namely e + e~ annihilation to two jets, 
which is the majority of hadronic e + e~ events, and study the distribution of hadrons 
with respect to the axis formed by the two jets. You can measure the rapidity, y, 
and the transverse momentum, p u of hadrons relative to that axis. If you plot the 
number of hadrons as a function of y, sketched on the left in Figure 9, you find that 
it is roughly flat up to some maximum value and then falls off very quickly. However 
looking at the right side of Figure 9 where the number of hadrons is sketched as a 
function of p t , it can be seen that this distribution is roughly Gaussian with a narrow 
width of 1 or 2 GeV. This means that most hadrons are produced with very low p t . 




Y y p t 

Fig. 9 The number of hadrons sketched as a function of rapidity, y, and transverse momentum, p t . 



One can make a very simple model based on this observation and estimate the 
hadronization correction to perturbative quantities. The energy of the jet is 



rY 

E= dy d 2 p t p (pj) coshy = AsinhF, 

Jo 



(26) 
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where Y is the maximum rapidity of hadrons in the jet and X is their mean transverse 
momentum, given by 

X = Jd 2 p t p(pj)p t , (27) 

which can be estimated from Fermi motion where X ~ l/Rhad ~ m had- The longi- 
tudinal momentum can be calculated in the same way: 

rY 

P= dyd 2 p t p(p 2 )sinhy = X(coshY-l)~E-X. (28) 

Jo 

The jet acquires a non-perturbative mass, given by 

M 2 =E 2 -P 2 ~2XE, (29) 

from which it can be seen that the non-perturbative invariant jet mass is proportional 
to the square root of its energy. This non-perturbative component is an important 
contribution, e.g. a 10 GeV contribution for 100 GeV jets. Since these corrections 
are so important, we need a precise model to predict them. 

The first set of models that were developed to describe hadronization were the so- 
called Independent Fragmentation Models (or "Feynman-Field" models) and they 
are a direct implementation of the procedure described above. The longitudinal mo- 
mentum distribution is an arbitrary fragmentation function, a parametrization of 
data. The transverse momentum distribution is assumed Gaussian and the model 
just recursively applies q — )> q' + had and hooks up the remaining soft q and q until 
the whole jet is hadronized. This model can describe e+e~ — » 2 jet events by con- 
struction, but has a lot of disadvantages: it is strongly frame dependent, there is no 
obvious relation with perturbative emission, it is not infrared safe and it is not a 
model for confinement. 



3.2 Confinement 

We know that in QCD we have asymptotic freedom: at very short distances a qq pair 
becomes more and more QED-like, but at long distances the non-Abelian gluon self- 
interaction makes the field lines attract each other, as sketched in Figure 10. As two 
colour charges are pulled apart the field lines do not spread out and we get a constant 
force or a linearly rising potential. One would have to invest an infinite amount of 
work to pull them apart - this is the signal of confinement. 

This interquark potential (or string tension) can for example be measured from 
quarkonia spectra, as shown in Figure 1 1, or from lattice QCD. The string tension K 
is found to be roughly 1 GeV/fm. 
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Fig. 10 Field lines in QED (left) and QCD (right) between a charge and an anti-charge. 
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Fig. 11 Charmonium and bottomonium spectra, the nS (n = 1,2,...) energy levels are roughly 
equally spaced (in more detail, they go like ~ n 2 / 3 ). Reproduced from [4]. 



3.3 String Model 



The first step in understanding the structure of hadrons is to take this string picture 
very literally and to look at the space-time diagram of a meson (qq), outlined in 
Figure 12. 

At some point in time the quark and the anti-quark (both assumed massless) are 
at the same point in space and they are flying apart at the speed of light, hence up 
a 45° diagonal in the space-time diagram. As they grow further apart they lay out 
a string between them that has a constant tension until the potential stored in this 
string uses up all their kinetic energy. At this point they turn around and the potential 
energy in the string accelerates the quarks towards each other until they meet back at 
the starting point, pass through each other, and the whole process starts over. These 
are the so-called "yo-yo" modes. 

It is a nice exercise in Lorentz transformations to think about what this process 
looks like when you are not in the rest frame but in a boosted frame, e.g. when 
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Fig. 12 Cartoon of string model, quark and antiquark moving apart from each other as seen in the 
meson rest frame (left) and in a boosted frame (right). 



the meson is moving to the right. The two points at which the quarks' directions 
reverse are not simultaneous and the string spends part of its cycle simply moving 
to the right, without expanding or contracting, and transferring momentum from one 
quark to the other. You can measure the speed of the meson from the slope of the 
line of its centre of mass. The area of these squares and rectangles is the same in 
both frames, i.e. it is Lorentz invariant, and obeys the area law 

m 2 = 2k 2 area. (30) 

The Lund String Model uses this picture as a model of hadronization. In the 
original, simple version of the model we start by ignoring gluon radiation. e + e~ 
annihilation is then a point-like source of qq pairs. In principle this system also 
has yo-yo modes, but in practice the space-time volume swept out is so large that 
another effect is able to dominate: in the intense chromomagnetic field of the string 
between the qq pair it is possible for additional qq pairs to spontaneously tunnel 
out of the vacuum. The chromomagnetic field is strong enough that the quarks are 
accelerated away from each other before they have time to re-annihilate 9 . The effect 
is that the string separates into two strings: it breaks. By analogy with a similar 
process in QED you can estimate the probability of this happening, as 

d(Probability) -Km 2 

d* d , ~ " ex p ^- (31) 

The mass dependence of this equation means that, for example, strange quarks will 
tunnel out less often than light quarks. 

When we have a lot of energy available it is likely that we will produce many 
hadrons, as the expanding string breaks into mesons long before the yo-yo point. 
Thus the original qq system has fragmented into a system of hadrons - this is the 



' This is somewhat analogous to Hawking radiation. 
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basic ingredient of the Lund String model illustrated in Figure 13. The space time 




Fig. 13 Cartoon of the string corresponding to a high energy qq event breaking up into hadrons. 



structure of this breakup is very interesting, as the breaks are causally disconnected, 
so they don't know about each other and there can be no causal correlations be- 
tween them. The Lorentz invariance and acausality give strong constraints on this 
hadronization process. In the end, we get a fragmentation function for hadrons with 
a constrained form with two adjustable parameters, a a and ap , 



f(z)<xZ aa - a V-\l-z) a V. 



The tunnelling probability then becomes 



exp-b(m 2 q +pf] 



(32) 



(33) 



where the main tuneable parameters of the model are a, as described above, b, cus- 
tomarily called the "Lund b parameter" related to the string tension, which can be 
seen to control the width of the p t distribution, and m 2 q , the masses of the individual 
quarks. 

An important new feature of this model, relative to independent fragmentation, 
is that it is universal and predictive: having been tuned to describe quark events it 
can predict the confinement of gluons. This is again related to the colour structure. 
In a three parton (qqg) system, the quark is colour-connected to the anticolour index 
of the gluon and the colour index of the gluon is connected to the antiquark. Thus 
the gluon makes a corner, or "kink", on the string. The acausality means that the 
breakup of the string is universal, but the Lorentz boost of a string means that the 
hadrons it produces go preferentially in the direction of its motion. Therefore most 
hadrons that the first string segment produces will go between the quark and the 
gluon, most hadrons from the second will go between the gluon and the antiquark 
and only very few hadrons will go between the quark and the antiquark. 
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Fig. 14 String structure for qqg (left) and qqy (right) events. 



This definite prediction of the string model is known as the string effect and can 
be seen experimentally, e.g. at the PETRA and LEP experiments, by comparing 3- 
jet events to 2-jet + photon events, which can be represented as in Figure 14, where 
hadrons prefer to be between the quark and the antiquark. 

In this model, there is a smooth matching with the parton shower, since a soft 
gluon with k± smaller than the inverse string width will have no effect on the 
hadronic final state. 

In summary, the string model has a very strong physical motivation. It is a model 
of confinement, unlike the earlier independent fragmentation models. It is universal 
and gives the best description of data. However, for many of the effects for which it 
gives a strongly motivated qualitative prediction, in practice its quantitative predic- 
tion depends on free parameters that can be tuned to data. The smooth matching to 
the parton shower can also be seen as a disadvantage if one wishes to learn about 
the perturbative phase of QCD evolution as it, in a sense, can cover up the precise 
information from the parton shower. This motivated people to think of a new model 
which will be discussed in the following sections. 



3.4 Preconfinement and the Cluster Model 



In the planar, or large N c , approximation, a gluon is a colour- anticolour pair. One 
can follow the colour structure of the parton shower and find for each external parton 
its colour partner to which it is colour connected. One finds that these colour- singlet 
pairs tend to end up close in phase space. The mass spectrum of colour- singlet pairs 
is asymptotically independent of energy or the production process and is peaked at 
low mass ~ go- It depends on go and A, but not the shower scale Q. This property is 
known as preconfinement and is the inspiration for the cluster hadronization model. 
The cluster model is motivated by thinking about the spectrum of mesonic states 
constructed from given quark and antiquark flavours. The lightest states are narrow, 
but the heavier ones are broad resonances - above 1.5 GeV or so one can picture a 
continuum of overlapping states of different spins. One can then think of the colour- 
anticolour pairs of preconfinement being projected directly onto this continuum. 
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We call them clusters. These decay to lighter well-known resonances and stable 
hadrons. Once we have summed over all possible spins for a given process we ef- 
fectively wash out all of the spin information and this assumption tells us that the 
decay should happen according to pure phase space. One immediate consequence 
of this is that heavier hadrons are suppressed - you get baryon and strangeness sup- 
pression "for free" (i.e. they are untuneable). The hadron-level properties are then 
fully determined by the cluster mass spectrum, i.e. by perturbative parameters, go 
is therefore a crucial parameter of the model. 

This naive cluster model works well for the bulk of colour singlet states but, 
although the cluster mass spectrum is peaked at small mass, there is a broad tail 
to high masses. A small fraction of clusters are too heavy for isotropic two-body 
decay to be a reasonable approximation. In the cluster fission model, these high mass 
colour-anticolour pairs split into two lighter clusters in a longitudinal, i.e. rather 
string-like, way. The fission threshold becomes another crucial parameter for tuning 
the cluster model as, although only ~ 15% of primary clusters get split, ~ 50% of 
hadrons come from them. 

The cluster model was found to describe data reasonably well, with far fewer 
parameters than the string model. However, although it was found to work well 
for the majority of hadrons, it was noticed that the leading hadrons were not hard 
enough. This was cured, at the expense of an additional parameter, by saying that 
perturbatively-produced quarks remember their direction somewhat, with probabil- 
ity distribution 

/>(eVex P -e 2 /e 2 , (34) 

so that this cluster fragments more along its own axis. This again is not completely 
isotropic but more string-like as it remembers the direction along which the colour 
is expanding. It also has more adjustable parameters to fit the data. 

The founding philosophy of the string and cluster model are quite opposite. The 
cluster model emphasizes the perturbative phase of parton evolution and postulates 
that if this is correctly described, "any old model" of hadronization will be good 
enough. The string model emphasizes the non-perturbative dynamics of the confine- 
ment of partons and started initially from a very simple treatment of the production 
of those partons. The accumulation of more precise data have led the models to 
converge to something more similar. The string model has had successively refined 
perturbative evolution and the cluster model has become successively more string- 
like. This leads one to wonder whether nature is pointing us towards a model in 
which the flavour mix is largely determined by the perturbative dynamics, as in the 
cluster model, and their distributions largely determined by non-perturbative string 
dynamics. 

We close this section by commenting briefly on the universality of hadroniza- 
tion parameters. With so many free parameters, one might question the predictive 
power of these models. However, one finds in practice that the parameters are uni- 
versal: that a single set of parameters describes the data at a wide range of ener- 
gies and processes. One can show that this is a consequence of preconfinement: 
the perturbative production and evolution of partons takes care of the process- and 
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energy-dependence and the transition from partons to hadrons is a local process, in- 
dependent of these factors. Thus, hadronization models tuned to e^e~ annihilation, 
and lower energy hadron collider data, are highly predictive for LHC events. 



3.5 Secondary Decays 

An often underestimated ingredient of event generators is the model of secondary 
particle decays. This is more important than often realized because in both the string 
and the cluster models it is rare that the clusters decay directly to the stable pions 
and kaons seen in the detector. Mostly they decay to higher resonances which then 
decay further. One might say that these decays have been measured and one can just 
"use the PDG", but often not all resonances in a given multiplet have been mea- 
sured, and rarely do the measured branching fractions add up to 100% or respect 
expected symmetries such as isospin exactly. So when these data tables in the MC 
are built, a lot of choices need to be made. Moreover, in the case of multi-body 
decays, the matrix elements are highly non-trivial and appropriate models have to 
be constructed for them. The decay tables, and the decay models that implement 
them, actually have a significant effect on the hadron yields, transverse momen- 
tum release and hadronization correction to event shapes. The choice of decay table 
should therefore be considered as part of the tuned parameter set and a change of 
decay tables should be accompanied by a re-tune. 



4 Underlying Event 

The preceding steps of hard process, parton shower, hadronization and secondary 
decays are sufficient to fully describe the final state of the hard process, in which 
a high energy parton from each incoming hadron interact to produce an arbitrarily 
complex final state. However, this process involves the extraction of a coloured par- 
ton from each of the hadrons, which are colourless bound states of many coloured 
partons. We therefore have to consider how the hadron remnants evolve, hadronize 
and, potentially, interact with each other. 

In a proton's rest frame, it is a spherically symmetric extended object. Therefore 
in the laboratory frame where two protons collide at high energy, they look like 
extremely flattened discs due to Lorentz contraction, as shown in Figure 15. Internal 
interactions are also extremely time dilated, so during the time that the discs overlap 
the protons' internal dynamics are effectively frozen. On the one hand, this means 
that a high energy interaction is extremely localized, and the whole of the parton 
shower and hadronization of the primary interaction happens in a very small space- 
time region and there is not time for it to be affected by the rest of the proton. On 
the other hand, it means that there is a very large overlap between the other partons 
in the protons and the possibility of additional interactions. 
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Fig. 15 Sketch of a proton-proton collision showing the Lorentz contraction of the protons. 



Historically, there are two main models that have been used. Even though the 
first is effectively ruled out by Tevatron and LHC data, it is still useful to discuss, to 
draw out the contrasting features of the second, more successful, model. 

The non-perturbative model is motivated by the fact that the soft parton-parton 
cross section is so large that there are many interactions everywhere in these discs 
and the assumption that these interactions are coherent across the discs. Thus the 
whole of one remnant interacts non-perturbatively with the whole of the other rem- 
nant. In the absence of an understanding of non-perturbative dynamics, our best 
hope is to parametrize data on these interactions. The only predictivity comes from 
the assumption that the underlying event at a given energy is independent of the hard 
process it underlies. This model was the default in HERWIG and is made available 
as an option in Herwig++ as it is still interesting to have a "straw man model" of 
soft hadronic interactions without any hard component. However, all the models that 
successfully describe the LHC data have a perturbative origin. 

In the perturbative models, the idea is that the perturbative parton-parton cross 
section is so large that additional local parton-parton interactions between other 
partons in the proton dominate. We do not therefore have a coherent scattering but 
multiple independent parton-parton interactions distributed across the disc, each 
producing their own hard processes and parton showers as well as the initial one 
that we started with. 

The underlying event is closely linked with what are often called "minimum bias" 
events. These are the final states of a typical proton-proton collision and typically 
consist of a small number of hadrons at low transverse momentum distributed across 
a wide range of rapidities. Although the name "minimum bias" is widely used it is 
important to keep in mind that this is an experimental statement. By "minimum" we 
mean "as little as possible" so the amount of bias is dependent on the experiment. 
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We would like to compare them with models that we think of as having zero bias, 
that are predicting all inelastic proton-proton collisions. To avoid confusion about 
the experiment-dependence of any minimum bias definition, the recent recommen- 
dation [3] is to describe the event class as "soft inclusive" events, reserving the name 
"minimum bias" for experimental attempts to measure these events. 

In analysis, people often assume that they can remove the effect of the underly- 
ing event by measuring soft inclusive events and then subtracting these off, as the 
features of the two are very similar. This works reasonably well as a first approx- 
imation but if you look into the details, fluctuations in the amount of underlying 
event and correlations between the underlying event and the measured jets are ex- 
tremely important. Making this assumption can potentially be quite dangerous and 
it is possible to underestimate the size of underlying event corrections. 

Most jet cross sections are very steep, typically falling like the 5 th or 6 th power 
of p t . If jets get a little extra energy from the underlying event their distribution 
gets shifted sideways, but since the distribution is so steeply falling a small shift 
sideways corresponds to a very large upwards or downwards shift in the curve. So 
a small contamination from the underlying event can give a large change in the jet 
production rate with given kinematics. This means that jet cross sections are sensi- 
tive to rare fluctuations in the underlying event and just subtracting off an average 
amount of underlying event is not necessarily meaningful. Processes with different 
p t distributions will have different underlying event corrections - the steeper the p t 
distribution is, the more a jet sample will be populated by lower p t jets that have 
been shifted up by rare fluctuations in the underlying event. It is therefore extremely 
important to have reliable underlying event models that can predict this 10 . The way 
to avoid this trouble is to not tune to the average amount of underlying event but 
to correct on an event-by-event basis, this way fluctuations and correlations will be 
better taken into account [32]. 



4.1 Multiparton Interaction Model 

The starting point for the perturbative model is the observation that the hard parton- 
parton cross section is so large that we have many parton-parton collisions in one 
proton-proton collision. This is demonstrated in Figure 16, which shows three 
curves using different PDF sets, with their different a? values. The total proton- 
proton cross section predicted by three different models is shown for comparison 11 . 



10 The underlying event itself is not usually assumed to be correlated with the process but there 
is a trigger bias - if you look at jets in a given kinematic range the distribution of the primary 
jets determines how much they are affected by the underlying event. For example, in Z and ZZ 
production not much difference in underlying event is expected, but because the ZZ process has a 
harder p t distribution of accompanying jets, it is less affected by the underlying event. 

11 This figure was made for 14 TeV centre-of-mass energy and before the LHC measurements of 
the total cross section. The measured values at 7 and 8 TeV are closest to those predicted by the 
model labelled as "DL+CDF" in the figure. 
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Fig. 16 Cross section at 14 TeV as a function of minimum p t for different PDF sets. Reproduced 
from [33]. 



For small p t , m in an d high energy the inclusive parton-parton cross section is larger 
than the total proton-proton cross section allowing more than one parton-parton scat- 
ter per proton-proton collision [34]. From PDFs calculated from deep inelastic scat- 
tering measurements, we know the distribution of momentum fractions of partons 
in the proton. What needs to be added is a model to describe the spatial distribution 
of partons within a proton. This is the only additional non-perturbative ingredient 
we need: with this we can calculate the distribution of number of scatters there are 
per proton-proton collision. 

For these matter distributions, the current models usually make the assumption 
that x and b factorize: 



ni(x,b\iL 2 ,s)=fi(x\iL 2 )G(b,s), 



(35) 



with fi(x\jl 2 ) the usual (inclusive) parton distribution functions, and that ft-parton 
distributions are independent: 



Hi j (x t , Xj , bt ,bj)= n t (x t , bt ) rij (xj ,bj) , 



(36) 



etc. In these approximations, the number of scatters at a fixed impact parameter, 
i.e. with a given overlap between the two protons, is then given by a Poisson distri- 
bution. This can be integrated over impact parameter to calculate the ^-scatter cross 
section: 
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(A(ft)ff* 



(37) 



with 



a„ = /d V AW J ' exp(-AWa-), 

A(fe)= / > d 2 /?iG(/?i)d 2 /?2G(/?2)5 (2) (b-bi+b 2 ). (38) 

These ingredients are sufficient to generate a number of scatters and their kinemat- 
ics and parton showers. The one remaining complication is how the colour of the 
different scatters is connected to each other. This has been studied by the Pythia au- 
thors in some detail [35] and recently also by the Herwig authors [36,37]. Figure 17 
shows two pp events for which the hard processes (gg — > qq and qg — )> qg) have ex- 
actly the same colour structure but the colour connections between the scatters and 
the external protons is different in the two cases. Although the parton showering 
will be identical in the two cases, the hadronization will differ, because the string 
connections (represented by the dashed lines in the right-hand part of each figure) 
differ. 





(a) (b) 

Fig. 17 Example of colour correlations for two pp events with the same hard scatter colour struc- 
ture. Reproduced from [3], adapted from [35]. 



Although perturbation theory can specify everything about the colour connec- 
tions in the centre of the event, it doesn't tell us how these colours are hooked into 
the wavefunction of the original protons. So we need to supplement our model. The 
Pythia authors have studied different algorithms to do this in some detail and iden- 
tified experimental observables that help to constrain them [38]. 



4.2 The Herwig++ Multiparton Interaction and Colour 
Reconnection Models 

The multiparton interaction model in Herwig++ is developed from the one avail- 
able as a plug-in to HERWIG, called Jimmy [39], but with a number of new fea- 
tures [36,40] . The idea is to use the eikonal model and optical theorem to connect the 
partonic scattering cross sections to the total, inelastic and elastic hadronic cross sec- 
tions. A simple separation of partonic scatters is made into hard (above p tl min ~ 3- 
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5 GeV), distributed perturbatively around 'hot spots' of high parton density in the 
protons, and soft (below p tl min), with a simple distribution with Gaussian 12 trans- 
verse momentum and valence-like momentum fraction distributed across the whole 
of the protons' radii. Once the total cross section and elastic form factor are fixed 
by data there are only two free parameters: Pt, m in an d the effective hot spot radius. 

When first implemented, this model gave a good description of underlying event 
data, but failed badly for soft inclusive analyses. It was realized that this was due to 
the issue of colour correlations between the scatters, which was not very carefully 
handled in the first implementation. Rohr, Siodmok and Gieseke have implemented 
a new model of colour reconnections in Herwig++ [37] based on the momentum 
structure. This also gives reconnection effects in e + e~ annihilation, so a refit of 
LEP-I and LEP-II data is necessitated, but the conclusion is that one can get a good 
fit of the e + e~ and LHC data, see for example Figure 18 where the red line is 
the best fit to the data without colour reconnections and is clearly nothing like the 
data. An important conclusion of this study is that the hadronization parameters are 
correlated with the reconnection probability - changing one necessitates a retuning 
of the other, but a good fit can be obtained for a wide range of colour reconnection 
probabilities. 
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Fig. 18 Comparison of Herwig++ 2.4.2, without colour reconnections, and Herwig++ 2.5, with 
colour reconnections, to ATLAS minimum-bias distributions at y/s = 0.9 TeV with N c h > 6, 

p t > 500 MeV and |tj | < 2.5, reproduced from [37]. 



12 In fact, once the parameters are fixed, an interesting feature emerges: the width-squared of the 
Gaussian is forced to be negative, giving an "inverted Gaussian" with very few events at very low 
transverse momentum and a concentration of events around pt, m in- This lends support to the multi- 
parton interaction model and the idea that the entire cross section could be described perturbatively 
at high energy. 
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4.3 Pythia implementation 

Pythia was the first event generator to incorporate multiparton interactions and its 
implementation is very well developed. An interesting feature that has emerged, that 
is not shared by Herwig or Sherpa, is the possibility that through colour connection 
effects there can be an interplay between multiparton interactions and the initial- 
state parton shower [41]. Starting with a single hard interaction at some value of p t , 
the simulation evolves downwards in p u with the possibility at every step of either 
generating an emission from an incoming parton or an additional scatter. At any 
point during this evolution there can be colour cross-talk between these different 
interactions and this will affect the distribution of hadronization and of the partons 
that are radiated by these multiple scatters. 

A recent study [42] also considered rescattering, where two partons out of one 
proton can scatter with the same parton from the other. This is suppressed by the 
fact that it must be a local process: the two partons in one proton must both be 
overlapping with the other. Nevertheless it does give another contribution to the 
fluctuations in the underlying event. 

Most existing models make the simplifying assumption in Equation (35) of fac- 
torization of x and b, i.e. of the momentum and spatial distribution of partons. In an- 
other recent study [43], Corke and Sjostrand implemented a model without this as- 
sumption by considering a Gaussian matter distribution with an x-dependent width, 

a(x) = ao ( 1 + 01 In - ) , (39) 

and looked at what you could learn about these parameters from the data. The ef- 
fect is to start producing more correlation with the underlying event. A higher mass 
final state is produced at higher x, therefore it has a narrower matter distribution 
and more underlying event. There is a correlation between the momentum used in 
the hard collision and the underlying event that accompanies it. They compared the 
underlying event in Z events and in events that produce a Z' of 1 TeV and found sig- 
nificant differences. They could obtain equally good fits of the existing underlying 
event data but with significant differences in their extrapolation to higher- scale pro- 
cesses. This is clearly something that requires further study, to improve the models 
and to understand the uncertainty they introduce in high-mass searches using jets, 
for example. 



4.4 Underlying Event Measurements 

Despite a ~ 25 year history, many aspects of our understanding of multi-parton in- 
teractions are still in their infancy. The Tevatron and especially LHC experiments 
have already opened up huge areas for further study, not only with a big step up 
in energy but also with much higher efficiency, purity and phase space coverage 
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than the previous measurements. There has also been a big change in the culture 
around the measurements, with an emphasis on physical (experiment-independent 
and generator-independent) observables that can be directly compared between ex- 
periments and with a wide variety of models, now or in the future. There is also a 
move towards making more targeted measurements of observables that are sensitive 
to specific physical effects, such as colour reconnections. The general conclusion 
is that all the existing models can describe the general underlying event and soft 
inclusive data well with tuning. The emphasis is moving towards understanding of 
correlations between hard and underlying events, rare corners of phase space (such 
as high multiplicity soft events) and the relationships between different model com- 
ponents. One of the main motivations for these studies is the fact that jet corrections 
depend strongly on these correlations, and high moments of distributions, and are 
physics-process dependent. A deeper understanding, and greater predictivity, is still 
needed. 



5 Summary 

As a summary of our discussion of event generators, we recall the main subjects that 
we have covered, commenting on how well they are understood from first principles. 
We briefly touched on the hard process which is generally a direct implementation of 
tree-level perturbation theory and hence extremely well understood. We discussed in 
detail the parton shower which is an approximation to all-order perturbation theory 
and therefore in principle well understood. Various approximations are made in con- 
structing parton showers and the effect of these is not always as small as anticipated. 
The cutting edge here is the matching between higher order fixed-order perturbation 
theory and parton showers, which should, in principle, be fully understandable from 
perturbation theory, but is at present the subject of some uncertainty. We then talked 
about hadronization which is less well understood from first principles. Although 
there are different models, they are well constrained by data and the extrapolation to 
LHC data is considered to be fairly reliable. Lastly, the underlying event is the least 
well understood out of all these. It is only weakly constrained by previous data and 
different models that fit the available data give quite different extrapolations. More- 
over, it is important to recall that correlations and rare fluctuations in the underlying 
event are as important as its average properties and are even less well tied down. 

Monte Carlo event generators are increasingly used as tools in almost every as- 
pect of high energy collider physics. As the data become more precise it becomes 
increasingly important not to use them as black boxes, but to question how reliable 
they are for the application at hand. The important question to ask is "What physics 
is dominating my effect?". We hope that these lecture notes have helped equip the 
reader to answer this question. 
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